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By IEEE measures, a 2015 article on big data 


is the most influential article ever published in 


Computer. Does that surprise your 


ow do you write a highly influential article for 

Computer? It is not that difficult, if you look at 

the example of the most influential article that 

the magazine has ever published (see “Article 
Facts”). First, organize a special issue on a topic that is 
about to become prominent in both public discussion and 
the work of our members. Second, identify five authorial 
teams that address the key issues of the topic, ranging 
from the current state of research to the way that topic 
will influence other parts of computing, and end with a 
discussion of emerging applications. Finally, write a clear 
and succinct summary of the issue that explains all of the 
controversies that will surround this new topic for the 
next five years. Then get everything to the managing edi- 
tor on deadline, sign the copyright transfer forms, and you 
will have the most cited, viewed, and downloaded article 
in Computer’s back catalog. 
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It is, of course, easy to dismiss this 
strategy as relying a little too much 
on happenstance and confusing pop- 
ularity with influence. Both claims 
are serious and deserve serious at- 
tention. However, before we consider 
both of them, let us identify the arti- 
cle. The article with the most down- 
loads and citations is the editor’s introduction to the June 
2013 special issue on big data, “Big Data: New Opportunities 
and New Challenges,” by Katina Michael and Keith Miller.' 

In 2013, big data was a relatively new topic that was dis- 
cussed primarily by a small group ofinsiders. For the years 


ARTICLE FACTS 


» Article: “Big Data: New Opportunities and New 
Challenges” 

» Authors: K. Michael and K. Miller 

» Citation: Computer, vol. 46, no. 6, pp. 22-24, 
June 2013 

» Computer influence rank: #1 with 18,486 down- 
loads and 108 citations 
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prior to that date, IEEE Xplore lists only 
73 articles with the term big data in its 
title and shows that the vast majority 
of those come from specialized con- 
ferences about databases, sensors, and 
knowledge engineering. The figures 
for the Association for Computing Ma- 
chinery library are similar. 

A couple of other dates help put 
this issue in context. The Hadoop File 
System from Apache, which had be- 
come important to a number of big 
data applications, was first released 
(in version 0.1.0) in April 2006. In 2011 
and 2012, the IEEE Computer Society 
(CS) was developing a new commu- 
nity around the technology of cloud 
computing. During those discussions, 
several members noted that cloud 
computing would be an important 
technology for the emerging problems 
of big data. 

From the perspective of the authors, 
the timing of the special issue required 
little special foresight. Michael, who was 
both coauthor of the article and coedi- 
tor of the special issue, worked within 
the IEEE Society on Social Implications 
of Technology and was aware of the 
growing importance of big data. “Every- 
one was talking big data in 2012,” she 
recalled, “but just like any other new 
buzz term, there was not much actually 
written on it.” At the time, Michael was 
editor-in-chief of IEEE Technology and 
Society Magazine and “deeply concerned 
about the privacy and security chal- 
lenges” that big data posed on ordinary 
citizens. “I could see the benefits,” she 
remembered, “and the costs.” Initially 
she and Miller, her coauthor, wanted to 
callthe issue “The Social Implications of 
Big Data: Pros and Cons,” but ultimately 
agreed to the name “Big Data: New 
Opportunities and New Challenges.” 

If the article and the correspond- 
ing special issue grew out of a spe- 
cialized community, it gained its in- 
fluence by building on the strength of 
that community. To understand how 


that influence developed and how it 
differed from simple popularity, we 
need to understand how professional 
technical articles are used in the work 
of our members. Broadly speaking, 
a technical article can do one of two 
things. First, it could be part of a con- 
versation that is trying to find the 
best way to solve a technical problem. 
Second, the article can present a tech- 
nology as a developed solution for 
other problems. 

This is a gross simplification of 
the problem, and we will develop it in 
more detail as this column progresses. 


to cite articles from the same trans- 
actions or proceedings series. This is 
one way that we can see the discussion 
over the best technical solution to 
a problem. 

In contrast, the articles in Computer 
and some of the other CS magazines 
rarely cite other articles from their 
prior issues. These articles tend to cite 
articles from transactions and confer- 
ences. In doing this, they are not en- 
gaging in the discussion over the best 
way to solve a problem. They are pre- 
senting mature ideas as possible solu- 
tions to other technical solutions. 


If the article and the corresponding special 
issue grew out of a specialized community, 
it gained its influence by building on the 
strength of that community. 


However, it is a well-recognized phe- 
nomenon in the technical literature 
that is known as black-boxing. While 
a technology is new and being devel- 
oped, it is open for discussion and 
debate by the community working 
on it. At some point, that community 
concludes that it has found the best 
way of handling that technology, and 
it ceases to discuss different possible 
solutions. At that point, the technol- 
ogy is said to have been black-boxed. 
The community then goes looking for 
ways to apply the technology. 

In the CS literature, the debate over 
the best possible technical solutions 
tend to take place in conference pro- 
ceedings and transactions. The pre- 
sentation of black-boxed technologies 
tends to take place in magazines and 
books. Again, this is a gross simplifi- 
cation of how the literature works. Yet, 
we can see some evidence of these two 
kinds of technical articles in the cita- 
tion lists of the periodicals. Articles 
in transactions and proceedings tend 


So one way of assessing the im- 
portance of an article is to determine 
which kind of periodical is citing it. 
If it is being cited by proceedings and 
transactions, it is part of the discus- 
sion over the results of research; it is 
being cited in the debate about how to 
best solve a technical problem. If it is 
being cited by magazines and books, 
it is likely part of the presentation ofa 
finished technology. 

Again, while this characteriza- 
tion of articles is somewhat crude, it 
reveals an interesting fact about Mi- 
chael and Miller’s introduction from 
the 2013 special issue on big data. It 
is being used in both kinds of techni- 
cal discussions. Within the IEEE lit- 
erature, it was cited by 32 conference 
proceedings and 11 magazines. Out- 
side the IEEE, the split is about equal. 
Of the 93 citations it has garnered, 
49 are from conferences or transac- 
tions. The rest are from magazines or 
books. Taken as a whole, it is evidence 
that the article was widely read and 


MARCH 2020 75 


BODY OF KNOWLEDGE 


used in a variety of situations. It was 
cited as part of the debate over big data 
technology. It was also used to present 
the ideas of big data to both the broad 
computer community and a bigger 
technical community. The response to 
the special issue, noted Michael, “has 
really gone far beyond our wildest 
expectations.” 

Of course, one of the reasons that this 
article has been so widely circulated has 
been the fact that it was part of a well-re- 
searched and well-written special issue. 
“We hada very long list of people who we 
wanted to approach,” Michael explained. 
“People from academia of course, but 
also people working with industry, peo- 
ple from big corporations, and even 
government.” The five other articles are 
a good representation of these institu- 
tions, and they, too, have become highly 
visible in their own right. All of them 


are in the list of top articles downloaded 
from IEEE Xplore. 

Yet, perhaps the most convincing 
reason that “Big Data: New Opportuni- 
ties and New Challenges” has become 
an important article is the language 
of the text itself. At a distance of seven 
years, it describes the opportunities 
of this technology as well as its chal- 
lenges in sentences that are direct and 
clear and reflect the nature of big data 
as we now understand it. “While big 
data can yield extremely useful infor- 
mation,” the article explains, “it also 
presents new challenges with respect 
to how much data to store, how much 
this will cost, whether the data will be 
secure, and how long it must be main- 
tained.” If you are seeking to write an 
influential article for Computer, you 
will need this third step as well as the 
other two. After you have identified 


a newly important topic for a special 
issue and found good authors to com- 
plete your special issue, you will do 
well to express your ideas in a clear, 
open form that will be quoted by cur- 
rent researchers as well as those who 
are attempting to understand the new 
technologies. 
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IEEE TRANSACTIONS ON 


COMPUTERS 


Call for Papers: 
IEEE Transactions 
on Computers 


Publish your work in the IEEE Computer 
Society's flagship journal, /EEE Transactions 
on Computers. The journal seeks papers on 
everything from computer architecture and 
software systems to machine learning and 
quantum computing. 


Learn about calls for papers 
and submission details at 


www.computer.org/tc. 
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